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Abstract — Existing coupling metrics only use the number of 
methods invocations, and does not consider the weight of the 
methods. Thus, they cannot measure coupling metrics 
accurately. In this paper, we measure the weight of methods 
using PageRank algorithm, and propose a new approach to 
improve coupling metrics using the weight. We validate the 
proposed approach by applying them to several open source 
projects. And we measure several coupling metrics using 
existing approach and proposed approach. As a result, the 
correlation between change-proneness and improved coupling 
metrics were significantly higher than existing coupling 
metrics. Hence, our improved coupling metrics can more 
accurately measure software. 

Index Terms — software quality metrics, coupling metrics, 
PageRank algorithm 

I. Introduction 

Software quality measurement is essential in software 
engineering. Coupling of software quality metrics is "the 
measure of the strength of association established by a 
connection from one module to another" [1]. Coupling is 
used for various purposes, such as modularization, reusability, 
fault-proneness, and change-proneness. In this paper, we 
propose a new approach to improve coupling metrics. Existing 
coupling metrics mainly use the number of method 
invocations [2]. These coupling metrics does not consider 
the weight of methods; assume that all methods have same 
weight. But since there are any methods that are more 
important than other methods, considering the weight of 
methods is more accurate. In this paper, we measure the weight 
of methods using PageRank algorithm, and propose a new 
approach to improve coupling metrics using the weight. We 
validate the proposed approach by applying them to several 
open source projects. And we measure several coupling 
metrics using existing approach and proposed approach. As 
a result, the correlation between change-proneness and 
improved coupling metrics were significantly higher than 
existing coupling metrics. We observed that improved 
coupling metrics could more accurately measure software 
quality than existing coupling metrics. The rest of the paper 
is organized as follows. Section 2 describes the existing 
coupling metrics; In Section 3, we outline our new approach 
to measure the weight of methods using PageRank algorithm 
for improve coupling metrics. Section 4 presents a case study 
as an empirical evaluation of the proposed approach. Section 
5 concludes the paper and outlines future research. 
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II. Related Work and Background 

A. Structural Coupling Metrics 

Existing coupling metrics only consider the structural 
relationship of software. CBO and COF measures coupling 
using call relationship between classes [3, 4, 5] . RFC measures 
coupling using the number of methods of a class and called 
methods of a class [3, 4]. MPC measures coupling using the 
number of called methods of a class [6]. DAC measures 
coupling using the number of class variables defined in a 
class [6]. In this paper, we improve four coupling metrics 
(CBO, MPC, RFC, and COF). Since these coupling metrics is 
directly consider the number of methods invocations, we 
select these metrics. 

B. Semantic Coupling Metrics 

Structural metrics only measures coupling from a data- 
flow perspective. Semantic metrics can measure coupling from 
a semantic perspective, such as functional. CoCC (Conceptual 
Coupling of a Class) measures coupling using conceptual 
similarity between elements of source code [7]. RTC (Relational 
Topic based Coupling) measure coupling using Relational 
Topic Models, generative probabilistic to capture latent 
topics in source code [8]. Since semantic metrics is only 
applicable to source code, these are not applicable to artifacts 
(i.e., method call graph) in design phase. In this paper, we 
introduce a new approach to improve coupling metrics that 
are applicable to artifacts in design phase. 

C. PageRank Algorithm 

PageRank is link analysis algorithm that assigns a 
numerical weighting to each element of a hyperlinked set of 
documents, such as the World Wide Web [9]. In this paper, 
we measure weight of methods using PageRank algorithm. 
Equation (1) shows that calculation method of PageRank 
algorithm. 

PR(A) = (1-d) + d(PR(T 1 )/C(T 1 )+...+PR(T )/C(T )) (1) 
d = 0.85 

C(T.) : the number of outbound links on page T. 

Figure 1 is webpage link graph that is generated randomly. 
This graph contains the nodes (webpages) and edges (link 
relationship between webpages). The initial value of 
PageRank for each page is assigned to 0. 




Figure 1. Webpage link graph 
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Since the calculation is an iterative process, the algorithm is 
stopped after a given count of iterations or if the PageRank 
value differences between iterations are less than predefined 
values. 

TABLE I. Results of PageRank Algorithm 



iterate 


PR(A) 


PR(B) 


PR(C) 


1 


015 


0.2S 


0.27 




0.27 


0.3S 


0.31 




031 


041 


0.33 


4 


0.33 


043 


0.33 


i" 


0.33 


043 


0.33 



III. Proposed Approach 

In this section, we measure the weight of methods using 
PageRank algorithm, and improve four coupling metrics using 
the measured weight of methods. 

A. Measurement of Methods Weight 

In order to measure coupling metrics, figure 2 is method 
call graph generated randomly. This graph consists of 
nodes(methods) and edges(call relationship between 
methods). 




Figure 2. Method call graph 

We applied Pagerank algorithm to this method call graph. 
Table 2 shows that weight of five methods were computed 
based on PageRank algorithm. 



TABLE II. Weight of Methods 



PR(Ml> 




PRflvL) 


PR(M*) 


PR(tvI=) 
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090 


O.SS 


030 


0.33 



Table 3 shows methods of each class. We can construct class 
call graph using Figure2 and Table 3. Figure 3 shows class 
call graph. 

TABLE III. Methods of Each Class 





methods 


Cl 


M L , Mj 


c 2 


Ma.M* 


G 


Ms 




Figure 3. Class call graph 

Table 4 shows that weight of three classes were computed 
based on PageRank algorithm. As mentioned above, we can 
measure weight of methods and classes using PageRank 
algorithm. 

TABLE IV. Weight of Classes 



PR(C t > 


PR(C) 


PR(C a ) 


130 


100 


0.70 



A. Improvement of Coupling Metrics 

In this section, coupling metrics can be described by 
following equations. When we use suffix 'N' in each metrics, 
it means improved cohesion metrics. 

1) CBO 

CB0(c) = \SIC(c)\ 
CBOM = ^ Pfl(c') 



SIC^c"} '■ The set of called classes of class 'c' 
PR( e) '■ PageRank of class 'c' 
2JMPC 

MPC(_c) = \SIM(c)\ 

MPC jV (c)= £ PR(m') 



(2) 
(3) 



(4) 
(5) 



7JI' esjAr( e} 

SIM[c~) '■ The set of called methods of class 'c' 
Pi?(m.] : PageRank of method 'm' 

3) RFC 

RFC(c) = \SIM(c)\ +M 
RFC N (c) = V PR(m') + ^ Pff(m,) 

ni'ES/Mlic} i = l 

it : The number of methods of class 'c' 

4) CFO 

\SIC(c)\ 



(6) 
(7) 



COF(c) - 



COF M (c) = 



c 2 — C 



c 2 -c 

C '■ The number of all classes 
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Table V. Open Source Projects 



Prcjsit 


Used version 


Des;rLpticn 


Class ss 


LOC 


Changed LOC 


Robocods 




ProHiammins sum 


166 


39991 


3659 


rHotDrsT.v 


7.1—^7.5 


Java GUI franis^ ctk 


150 


29667 


340S 


JsbRe: 


i.0^2.7 


Biblic •Erapb.y reference manager 




291 32 


1232 


HsritrLx 


1.0-*1.4 


'vVet crawler 




5S2SS 


3712 



IV. Case Study 

We validate the proposed approach by applying them to 
four open source projects. Table 5 gives a description of four 
open source project. 

A. Experimental Process 

1) Java parser generates class information. And we can 
measure coupling metrics of each class using created class 
information. 

2) Then we count the number of changed lines in each 
class. And we computed change-proneness using the number 
of changed lines. In this paper, we define change-proneness 
as equation (10) that is independent of source code size. 



Change proneness 



number of changed lines 
number of all lines 



(10) 



3) Finally, we computed Pearson's correlation coefficient 
between change-proneness and coupling metrics, for each 
class. Generally, coupling metrics is used as predictor of the 
change-proneness of a class [10, 11, 12]. Therefore, in order 
to empirically confirm superiority of improved coupling 
metrics, we computed correlation coefficient between change- 
proneness and coupling metrics. 

B. Experimental Results 

We measured four coupling metrics of each class with 
existing and proposed approach. And we compute the 
correlation coefficient between change-proneness and each 
coupling metrics. We show the results of this experiment in 
figure 4. In this figure 4, the first bar represents the correlation 
coefficient between change-proneness and existing coupling 
metrics, and the second bar represents the correlation 
coefficient between change-proneness and improved 
coupling metrics. In four open source projects, we observed 
that the correlation coefficient of all coupling metrics has 
been improved. Our improved coupling metrics showed 
significantly an improved correlation coefficient as compared 
to the existing coupling metrics. Thus, improved coupling 
metrics proved to be better predictors than existing coupling 
metrics. It can be concluded that improved coupling metrics 
is the better metrics. 
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(d) Heritrix 1.0 -* 1.4 
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V. Conclusion And Future Work 

In this paper, we measure the weight of methods using 
PageRank algorithm, and proposed a new approach to 
improve coupling metrics using the weight. We applied 
improved coupling metrics to four open source projects, 
measured the correlation coefficient between change- 
proneness and coupling metrics, and empirically observed 
superiority of proposed approach. In the future work, we will 
explore new approach to measure the weight of source code 
lines, and improve complexity metrics using the weight. 
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